Multi-GPU thermal lattice Boltzmann simulations using OpenACC and MPI

نویسندگان

چکیده

We assess the performance of hybrid Open Accelerator (OpenACC) and Message Passing Interface (MPI) approach for multi-graphics processing units (GPUs) accelerated thermal lattice Boltzmann (LB) simulation. The OpenACC accelerates computation on a single GPU, MPI synchronizes information between multiple GPUs. With two-dimension (2D) simulation achieved 1.93 billion updates per second (GLUPS) with grid number $8193^{2}$, three-dimension (3D) 1.04 GLUPS $385^{3}$, which is more than 76% theoretical maximum performance. On multi-GPUs, we adopt block partitioning, overlapping communications computations, concurrent to optimize parallel efficiency. show that in strong scaling test, using 16 GPUs, 2D 30.42 3D 14.52 GLUPS. In weak efficiency remains above 99% up Our results demonstrated that, improved data task management, technique promising LB multi-GPUs.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accelerating the D3Q19 Lattice Boltzmann Model with OpenACC and MPI

Multi-GPU implementations of the Lattice Boltzmann method are of practical interest as they allow the study of turbulent flows on large-scale simulations at high Reynolds numbers. Although programming GPUs, and in general power-efficient accelerators, typically guarantees high performances, the lack of portability in their low-level programming models implies significant efforts for maintainabi...

متن کامل

Lattice Simulations using OpenACC compilers

OpenACC compilers allow one to use Graphics Processing Units without having to write explicit CUDA codes. Programs can be modified incrementally using OpenMP like directives which causes the compiler to generate CUDA kernels to be run on the GPUs. In this article we look at the performance gain in lattice simulations with dynamical fermions using OpenACC compilers.

متن کامل

Computational Science and Engineering (int. Master's Program) Mpi Parallelization of Gpu-based Lattice Boltzmann Simulations

I hereby declare that this thesis is entirely the result of my own work except where otherwise indicated. I have only used the resources given in the list of references. Christoph Riesinger and Martin Schreiber for their ongoing support of my work, for helpful discussions and encouragement though out the time.

متن کامل

Multi-GPU Implementation of a Hybrid Thermal Lattice Boltzmann Solver using the TheLMA Framework

In this contribution, a single-node multi-GPU thermal lattice Boltzmann solver is presented. The program is based on the TheLMA framework which was developed for the purpose. The chosen implementation and optimisation strategies are described, both for inter-GPU communication and for coupling with the thermal component of the model. Validation and performance results are provided as well.

متن کامل

Flow Simulations Using Two Dimensional Thermal Lattice Boltzmann Method

Lattice Boltzmann method is implemented to study hydrodynamically and thermally developing steady laminar flows in a channel. Numerical simulation of two-dimensional convective heat transfer problem is conducted using two-dimensional, nine directional D2Q9 thermal lattice Boltzmann arrangements. The velocity and temperature profiles in the developing region predicted by Lattice Boltzmann method...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Heat and Mass Transfer

سال: 2023

ISSN: ['1879-2189', '0017-9310']

DOI: https://doi.org/10.1016/j.ijheatmasstransfer.2022.123649